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A multi-hop synchronous "wirelss net"work is said to be unknown if the nodes have no kno'wl- 

edge of the topology. A basic task in -wireless net"work is that of broadcasting a message (created 

^^ [ by a fixed source node) to all nodes of the net"work. Typical operations in real-life "wireless 

^^ ' networks is the multi-broadcast that consists in performing a set of r independent broadcasts. 

C/3 , The study of broadcast operations on unkno"wn "wireless network is started by the seminal paper 



o 



O 



X 



Abstract 



O ■ of Bar- Yehuda et al [BGI87| and has been the subject of several recent works. 

In this paper, we study the completion and the termination time of distributed protocols for both 
the (single) broadcast and the multi-broadcast operations on unknown networks as functions 
of the number of nodes n, the maximum eccentricity -D, the maximum in-degree A, and the 
congestion c of the networks. We establish new connections between these operations and 
some combinatorial concepts, such as selective families, strongly-selective families (also known 
r — ' as superimposed codes), and pairwise r-difjerent families. Such connections, combined with a 

f^ I set of new lower and upper bounds on the size of the above families, allow us to derive new 

lower bounds and new distributed protocols for the broadcast and multi-broadcast operations. 
In particular, our upper bounds are almost tight and improve exponentially over the previous 
c/2 ■ bounds when D and A are polylogarithmic in n. Network topologies having "small" eccentricity 



and "small" degree (such as bounded-degree expanders) are often used in practice to achieve 
efficient communication. 
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1 Introduction 



1.1 Wireless networks 



Static ad-hoc wireless networks (in short, radio networks) have been the subject of several works in 
recent years due to their potential applications in scenarios such as battlefields, emergency disaster 
relief, and in any situation in which it is very difficult (or impossible) to provide the necessary 
infrastructure [ [R96| , WNEOO| | . As in other network models, a challenging task is to enable fast 
communication. 

A radio network is a set of radio stations that are able to communicate by transmitting and 
receiving radio signals. A transmission range is assigned to each station s and any other station 
within this range can directly (i.e. by one hop) receive messages from s. Communication between 
two stations that are not within their respective ranges can be achieved by multi-hop transmissions. 
A useful (and sometimes unavoidable) paradigm of radio communication is the structuring of 
communication into synchronous time-slots. This paradigm is commonly adopted in the practical 
design of protocols and hence the use of the paradigm in theoretical analysis is well motivated 
| BGI87] , [G^ , |R7^ . 

A radio network can be modeled as a directed graph where an edge {u, v) exists if and only if 
u can communicate with v in one hop. The nodes of a radio network are processing units, each 
of which is able to perform local computations. It is also assumed that every node can perform 
any local computation required for deciding the next send/receive operation during the current 
time-slot. In every time-slot, each node can be active or non active. When active, it can decide to 
be either transmitter or receiver, in the former case the node transmits a message along all of its 
outgoing edges while, in the latter case, it tries to recover messages from all its incoming edges. 
When it is not active, it does not perform any kind of operation. The fundamental feature here is 
that a node v can recover a message from one of its incoming edges if and only if this edge is the 
only one bringing in a message. If two or more neighbors of a node are transmitting at the same 
time-slot then a collision occurs. Nodes do not distinguish between the background noise and the 
interference noise (i.e., we are in the case of absence of collision detection PL95 , BGI87[| ). A radio 
network is said to be unknown when every node knows nothing about the network but its own 
label IBGI87I , pGGPROO| , ICGOROOJ ICGROOJ . Informally speaking, unknown radio networks, with 
absence of collision detection, model communication networks with multi-access channels in which 
the assumptions on the processors' knowledge and on the channel are minimal. In fact, in several 
applications, the network topology is unstable or dynamic and it is difficult to distinguish the 
presence of a collision from the background noise of the channel. Another important motivation 
in studying unknown networks comes from its strong connection with the fault-tolerance issue 
PR97| , [KKP98| , KM9^ , CMSOlc]. Unknown networks can also be seen as "known" networks (i.e. 
networks in which nodes have the knowledge of the entire initial topology) with unknown permanent 
faults. A node (or an edge) is said to suffer a permanent fault if it is never active during the entire 
execution of the protocol |]KKP9S , CMS01c|| . It is easy to show that a broadcast protocol for 
unknown fault-free networks is also an (unknown permanent-)fault tolerant protocol for general 
(known) networks and viceversa. So, the results obtained in the unknown model immediately 
apply to the permanent-fault tolerance issue. 

One of the fundamental tasks in network communication is the broadcast operation. It consists 
in transmitting a message from one source node to all the nodes. 



According to the network model described above, the communication protocol operates in 

time-slots: at every time-slot, each active node decides to either transmit or receive, or turn into 

the non active state. Two kinds of broadcast protocols have been considered in the literature 

BGI87 , BD97 , pGGPROO , CGOR00 |: spontaneous protocols, in which the starting time-slot is 



known to all the nodes and, thus, every node can transmit even if it has not received any message 
in previous time-slots; non spontaneous protocols in which a node (which is not the source) may 
act as a transmitter in a time-slot only if it has received a message in some previous time-slots 
(while the source starts at time-slot 0). A deterministic (randomized) broadcast protocol is said to 
have completed broadcasting when all nodes, reachable from the source, have received (with high 
probabilityFI) the source message. Notice that when this happens, the nodes not necessarily stop 
to run the protocol since they might not know that the operation is completed. We also say that a 
broadcast protocol terminates in time t if, after the time-slot t, all the nodes are in the non active 
state (i.e. when all nodes stop to run the protocol). 

The completion and termination time of Deterministic (Randomized) Broadcast protocols, in 
short DB (RB) protocols, will be analysed as functions of the following parameters of the network: 
the number n of nodes, the maximum in-degree A, and the maximum eccentricity D over all 
possible source nodes. Given a source node s, the eccentricity of s is the largest distance between 
s and any node of the network. Observe that the maximum eccentricity equals the diameter in the 
case of symmetric networks. 

A typical task in real-life radio networks is that of performing a set of simultaneous and inde- 
pendent broadcast operations: a multi-broadcast operation is to perform r > 1 broadcasts (from an 
arbitrary multiset of source nodes). The completion time of a Deterministic (Randomized) multi- 
Broadcast protocol, in short multi-DB (multi-RB) protocols, is defined as follows. A multi-DB 
(multi-RB) protocol on a radio network has completion time t if, (with high probability) every 
broadcast message is received by all the nodes reachable from the source of the message within the 
first t time-slots. The termination time of a multi-DB (multi-RB) protocol is defined as for DB 
protocols. 

As for the channel bandwidth, we distinguish two models. In the Unbounded- Bandwidth (in 
short UB) model [|CGROO |, a node can send/receive messages of unbounded size (so, a node can 



send an arbitrary large subset of the r messages in one time-slot). In the Bounded- Bandwidth (in 
short BB) model [ BII93| ], every node can send messages of size at most 0(logn + logr) in one time- 



slot. In this model, the completion time of the protocols also depends on the congestion (denoted 
as c) which is defined as the maximum number of broadcast messages that a node has to receive 
(where the maximum is computed over all possible nodes). 

1.2 Previous results 

Broadcast. Broadcasting in unknown radio networks has been introduced and studied in the sem- 



inal paper [BGI87] by Bar- Yehuda et al. They proved a lower bound 0(n) on the completion time 
of any DB protocol running on a family of unknown, symmetric radio networks of diameter 3. They 
also provide a non spontaneous RB protocol having expected completion time 0((-D + logn) logn)). 
Thus, the Bar- Yehuda et aVs work represents an important example in which randomized compu- 
tations have been proved to be exponentially faster than deterministic ones. On the other hand, 



their randomized protocol does not terminate when no upper bound on n is known. In [ABLP8E], a 



^A formal definition of completion time for randomized protocols will be given later. 



lower bound J7(log^ n) is shown for RB protocols that holds even for graphs of constant eccentricity 
(and diameter). The best known general lower bound for RB protocols is il.{Dlog{n/D)), obtained 
in p<M93| ] ■ As for non spontaneous DB protocols, Bruschi and Del Pinto [ BD97| ] obtained a lower 
bound r2(L)logn) for symmetric networks of diameter D. Moreover, an equivalent lower bound for 



spontaneous DB protocols on directed networks has been proved by Chlebus et al |CGGPROO|. The 
first positive results on DB protocols in unknown radio networks have been presented in DKKP99| ; 
however, this paper only studies radio networks having restricted topologies. In [pGGPROC |, a DB 
protocol for symmetric unknown networks is presented that has 0{n) completion and termination 



time thus matching the lower bound in [ BGI87 |, and a DB protocol for general unknown networks 
that has 0{n}^'^) completion and termination time. This protocol is the first one that makes 
an explicit use of selective families: as we will see later, such families play a crucial role in the 
techniques introduced in this paper. 

By means of a better use of selective families, more efficient DB protocols have been obtained in 
CGOROC]. They obtain a protocol having 0{n^'^) completion and termination time; for networks 
of maximum in-degree A = 0(n"), they obtain 0{'n}~^°'~^^^"'>~^°^^') completion and termination time 
(where H{a) denotes the binary entropy function). They also present a DB protocol having 0{'n?''^) 
completion and termination time and a DB protocol having 0(nA^ log n/ log(A log n)) completion 
time working on unknown networks of maximum in-degree A. 

The best presently known deterministic upper bound for general unknown networks is 0(n log n) 
and is obtained by means of a not efficiently constructible DB protocol introduced by Chrobak et 
aim ICGROq ]. 

It thus turns out that all previous deterministic upper bounds are superlinear (in n) indepen- 
dently of the parameters D and A of the network. A simple analysis of such protocols show that 
they in fact runs in superlinear time even when D and A are bounded by a fixed constant. In sev- 
eral wireless network applications, network topologies with "small" diameter and "small" degree 



are typically adopted in order to achieve fast and reliable communication | AS92 , Lei91 , KR96 |. It 
is thus natural and well-motivated to investigate the complexity of broadcast operations restricted 
to such topologies: the "hope" is to obtain protocols which run faster than the previous "general" 
protocols. 



Multi-broadcast. As for the UB model, Chrobak et al [ CGROO| provide a multi-DB protocol for 
the gossiping operation (i.e., the special case of n simultaneous broadcast operations, each starting 
from one different node) that has 0(n^'^) completion time. Multi-broadcast in the BB model has 
been studied in | BII93|| , where a randomized distributed protocol is presented that performs the 
broadcast of r messages in 0((£) + r) log A log n) completion time. However, this protocol does not 
work on unknown networks: it indeed assumes that every node knows its respective neighborhood 
and that the network is symmetric. Moreover, it requires a set-up phase in which a Breadh First 
Search tree is computed in 0{{n + D log n) log A) time-slots. 



1.3 Our results 

Broadcast. Our first contribution is the construction of a family of graphs that yields an fl{n log D) 
lower bound on the completion time of DB protocols on unknown networks. The above result applies 
to both spontaneous protocols on directed networks and non spontaneous protocols on symmetric 
networks. The lower bound given in | CGGPRO0|] and |BD97] (i.e. J7(Dlogn)) improves over the 
linear lower bound given in |BGI87] for D > 3 only when D is "almost" linear, i.e., D = uj{n/logn). 



Instead, our lower bound implies a superlinear number of time-slots for any D = w(l) and, 
moreover, it implies that the 0(n log n) deterministic upper bound given in [ CGROOl is almost 
optimal when D = 0(n"), for any constant a > 0. We emphasize that our lower bound also holds 
when every node knows n and A. A simple variant of our family of graphs allows us to get the 
first lower bound that also depends on A: we indeed provide an J7(DAlog(n/A)) lower bound that 
holds for any A < n/D. This lower bound implies that the bound 0(nlog n) given in | CGROC ] is 
almost optimal whenever DA = il(n). 

On the other hand, we provide a new broadcast technique that yields the first (non constructive) 
DB protocols having a completion-time that does not contain n as linear factor but only D and 
A. More precisely, we obtain an 0(DAlog(n/A) log "*"" n) upper bound, where a is any fixed real 
positive constant. 

Our protocols are thus not efficient when DA = a;(npolylogn). However, by comparing them 
with our Q(DAlog(n/A)) lower bound, we can see that these upper bounds are almost opti- 
mal when A = 0{n/D). This solves an open problem posed in [CGOR00|. Furthermore, when 
D,A = O(polylogn), this is an exponential improvement over the superlinear (in n) completion 
time obtained by the best previously known (not efficietly constructible) DB protocols |CGROO[ . 

Another interesting consequence lies in a new insight into the real gap between randomization 
and determinism in radio broadcasting. Indeed, rather surprisingly, if we compare our deterministic 
upper bounds to the 0{{D + log n) log n) upper bound obtained by the Bar- Yehuda et a/'s RD 
protocol [|BGI87| ], we can easily state that the exponential gap between deterministic protocols 
and randomized ones holds, at least from a theoretical view point, only for radio networks having 
"large" maximum in-degree (i.e. when A = ^{n^), for some /? > 0). 

The following table summarize previous and our results for the broadcast operation on unknown 
networks. 





Previous results 


Our results 
Deterministic 




Deterministic 


Randomized 


Lower bound 


n{Dlogn) 


n{Dlog{n/D)) 


17(DAlog(n/A)) for A < n/D 


Upper bound 


0(nlog n) 


0((D + logn)logn) 


0(DAlog(n/A)log^+°n) 



Multi-broadcast. Let us first consider the BB model. By combining the trivial lower bound Q{D) 
with the fact that a node cannot receive more than one message per time-slot, it is easy to derive 
an r2(D + c) lower bound for the multi-broadcast operation for both randomized and deterministic 
protocols (observe that, in the UB model, we can only get r2(D) since the congestion c is always 
1). On the other hand, we are not aware of any lower bound of the form i}(f{c) ■ g{n)) where 
both / and g are some unbounded functions. Such a kind of lower bounds is important since it 
implies that a "perfect pipeline" protocol (i.e., a protocol yielding an 0{SB{D, n) + c) upper bound, 
where SB{D,n) is the best upper bound available for the broadcast operation) is not achievable. 
We provide the first lower bound of the kind defined above, even under very restrictive topology 
conditions. We indeed derive a family of graphs with A = 2 that forces any multi-DB protocol to 
perform at least Q(c+([^ + D) logn) time-slots to complete multi-broadcast operations. Then, we 
derive an Q{c+ j^ logn + Dlog -g) lower bound to multi-RD protocols. Hence, perfect pipelining 
is not achievable even with the help of distributed random choices. 

We observe that the above lower bounds also hold in presence of collision detection and when the 
nodes know n and/or A. 



On the other hand, we combine a variant of our (single) broadcast technique with a suitable 
"local" scheduling (that solves the congestions arising inside every node) in order to get a multi-DB 
protocol for the BB model. This protocol has 0{{D + c)A^ log "*"" n) completion time (where a is 
any fixed real positive constant) and it can be converted into an efficiently constructible one having 
0((D + c) A^ log^^° n) completon time. 

By comparing the above upper bounds with our deterministic lower bound, we have that our multi- 
DB protocols turn out to be "almost" optimal (i.e., only a polylogarithmic factor away from the 
lower bound) when A = O(polylogn). We also emphasize that, for A = 0(1), our deterministic 



upper bound is almost equivalent to the 0{{D + r) log A log n) randomized upper bound BII93 | 
in which it is even assumed that the network is symmetric and the nodes know their respective 
neighborhood. 

As for the UB model, since arbitrary large concatenation of the messages inside a node can 
be sent along the outgoing edges, we can use a simpler version of our multi-DB protocols. In this 
version, the local scheduling is not required and, thus, we get an 0(L)A^log ^'^ n) upper bound. 

1.4 Organization of the paper 

Section g provides an overview of the connections between the issue of radio broadcasting and 
some combinatorial concepts and results. In Section ^, the proofs of such combinatorial results 
are given. Section § describes the results on the broadcast operation. In Section |^, the results on 
the multi-broadcast operation are presented. Finally, Section ^ discusses the obtained results and 
proposes some open problems. 

2 Adopted techniques and some new combinatorial results: an 
overview 

The proofs of our upper and lower bounds exploit some new combinatorial results that we believe 
to have a per se interest. In this section, we provide a description of such results and we outline 
their connection with broadcast and multi-broadcast operations. 



In [CGGPROO], Chlebus et al introduced the use of selective families in designing DB protocols 



in unknown networks. In what follows, the set {1, . . . ,n} is denoted as [n]. 

Definition 2.1 Let n and k be any integers with k < n. A family T of subsets of [n] is {n,k)- 
selective if, for every non empty subset Z of [n] such that \Z\ < k, there is a set F in T such that 

\zr\F\ = 1. 

In fact, selective families can be used to design oblivious DB protocols. A protocol is oblivious 
when its actions can be scheduled in advance. Let T = {-Fi, . . . , Fm} be an (n, A)-selective family. 
Then, it is easy to define an oblivious DB protocol on a network of n nodes and maximum in-degree 
A (this protocol has been introduced in jCGGPROq , |CGOR00|| ). 



A node u transmits at time-slot i < m iff it has received the source message (i.e. it is 
informed) and u £ Ff. 

During the execution of these m time-slots, thanks to the selective property of J-, for each set of 
d < A nodes, there is at least one time-slot in which only one of the nodes of the set can transmit. 



This guarantees that at least one non informed node gets informed. By iterating this process n 
times, the broadcast is completed. Thus the completion time of such DB protocol is n\J^\. 

We prove that there exist (n, fc)-selective families of size 0(A;log(n/A;)). We also prove that this 
upper bound is optimal, that is, any (n, /c)-selective family has size i^{klog{n/k)). On one hand, 
such small selective families are combined with a new broadcast technique in order to obtain our 
DB protocols. On the other hand, the lower bound on the size of selective families allows us to 
obtain lower bounds on the completion time of DB protocols. 

In designing multi-DB protocols, we introduce families of sets having a stronger selective prop- 
erty. 

Definition 2.2 Let k < n. A family T of subsets of [n] is (n, A;)-strongly-selective if for every 
subset Z of [n] such that \Z\ < k and for every element z G Z there is a set F in J^ such that 
ZnF = {z}. 

If we have at hand an (n, A + l)-strongly-selective family JT = {Fi, . . . ,Fm}, then it is easy to 
define an oblivious multi-DB protocol for the UB model on a network of n nodes and maximum 
in-degree A. A node u transmits (all the messages it knows) at time-slot t < m iS u £ Ff. By 
the strongly-selective property of J^, for each set X of d < A + 1 nodes and for each node u G X, 
there is at least one time-slot in which only u transmits among the nodes in X. This guarantees 
that every message reaches at least a new node during the m time-slots. By iterating this process 
n times the multi-broadcast is completed in n\J^\ time-slots. 

Similarly to our single DB protocol, our multibroadcast protocol exploits the existence and the 
construction of strongly-selective families of small size. Actually, strongly-selective families are a 
new appearance of a well-known notion. Let J^ = {Fi, . . . , Fm} be a family of subsets of [n] and 
consider the matrix M of m rows and n columns where Mf^^ is set to 1 iff i G -Fj. It turns out that 
J^ is an (n, A:)-strongly-selective family iff the OR of any set of at most k columns of M covers only 
the columns of the set. Hence the columns of M-^ form a superimposed code |KS64|. Superimposed 



codes are also known in combinatorics as cover free families [ EFF85| ]. Our protocol uses the (n, A;)- 



strongly-selective families of size 0(min{n, /c^ logn}) whose existence is proved in [ EFF85|] . This 
existence proof does not provide an efficient construction. However, efficient construction of such 
families can be found in |KS64|: in this case the size is 0(min{n, fc^ log n}). 



Since a lower bound on the size of strongly-selective families determines a lower bound on the 
completion time of our multi-broadcast technique, we have also investigated this combinatorial 
aspect. In |CR96], Chaudhuri and Radhakrishnan obtain a lower bound ri((/c^/ log fc) log n) for 
sufficiently large k such k < n^'^. Our contribution here is the extension to every k of that bound, 
that is, [7(min{n, (fc^/log A;) logn}). This implies that there are no significantly smaller strongly- 
selective families than those adopted by our protocols. 

As in the case of selective families, we tried to exploit strongly-selective families to obtain good 
lower bounds on the completion time of multi-DB protocols. But we could not obtain anything 
better than the lower bound for single broadcast. 

In the BB model, besides the interference problems, the bound on the channel bandwidth yields 
further delays due to the congestion inside the nodes of the network. We exploit this aspect to 
obtain a non trivial lower bound. Consider the following situation that can happen during the 
execution of a multi-broadcast protocol. There are n nodes each of them having r messages. They 
are not connected each other, and exactly two of them are the only in-neighbours of another node 
u. Thus, all the r messages can be received by u only if, for every pair of nodes and for each 



message, there is a time-slot in which exactly one of the nodes of the pair transmits that message 
and the other does not transmit anything. This suggests to introduce the following notion. 

Definition 2.3 Two sequences x and y of equal length over the alphabet {0} U [r] are r-different if 
for any z £ [r] there is a coordinate i for which {xj,yj} = {2;,0}. 

From the situation described above, we will show that a protocol which performs any multi- 
broadcast operation on unknown networks within t time-slots must yield a set of n pairwise r- 
different sequences of length not greater than t. By combining this connection with a new lower 
bound on the length of such sequences, i.e. 0((r/logr) logn), we derive the lower bounds for the 
multi-broadcast operation on the BB model. 

3 Combinatorial results 

3.1 Selective families 

We now show the existence of (n, A:)-selective families of small size by a suitable application of the 



probabihstic method [|AS92 | 



Theorem 3.1 For any n > 2 and k>2, there exists (n,k) -selective family of size 0{klog(n/k)). 

Proof. In the sequel we say that a family J^ is selective for a family S if, for each S £ S, there is 
a set F £ T such that |F n 5| = 1. Let Si, 1 < i < [log fc] , be the family of all the subsets of [n] 
having size in the range (2*~^,2*]. Consider now a family J^i of li sets (the value of li is specified 
later) in which each set is defined by randomly picking every element of [n] independently, with 
probability i. 
Fix a set S £ Si and consider a set F £ Ti] then it holds that 

LI ' J 2* V 2V 2* V 2V - 4-2* - 8 

where the second inequality is due to the fact that (! — ]■) >|fort>2. 

The sets in Ti have been constructed independently, so, from the above inequality, the probability 

that J-i does not select S is at most 

1\'» h 



< e 8 
Hence we have that 



2' / 

Pr[.Fj is not selective for Si] < ^J Pr[-^i doesn't select S] < ^J 

S&Si d=2'-i+l V 




By choosing /, > 81n ^_^ 2M , we get 



2^-1 1 



Since log 



n 
t 



0{tlog{n/t)), it holds that k = 0(2' log(n/2*)) thus there exists a family Ti 



selective for Si and having size 0(2Mog(n/2*)). Finally, we consider the (n, fc)-selective family 
whose size is 



[log fc] 

u ^^ 



[log k] 

^ 0(2Mog(n/2')) = 0(A;log(n/A;)). 



D 



In what follows, we provide a lower bound on the size of selective families. To this aim, we 
make use of the notion of intersection free family 

Definition 3.1 Let I < k < n. A family T of k-subsets of [n] is (n. A:, /)-intersection free if 
I Fi n i<2 1 7^ ^ for every Fi and F2 from T . 

Roughly speaking, the intersection free property is somewhat "complementary" to the selectivity 
property we are using in this paper. So, even though an explicit mathematical connection be- 
tween the two properties will be determined later, the reader can already imagine our interest in 
introducing the following result obtained by Frankl and Fiiredi. 



Theorem 3.2 | FF85 ] Let T he an (n, k, I) -intersection free family where 21 + 1 > k and k — I is a 
prime power. Then it holds that 



m< 



n 
I 



2k -I -I 
k 



2k -I -I 
I 



In particular, we first prove the following consequence of the above theorem 

Corollary 3.1 Let T be an (n, k, 2) -intersection free family where k is a power of 2 and k < -^ 



Then it holds that 















1 1^1 -Lift 
log 1^1 < ^2 


1 '^ 






Proof. 


By 


using 


the following 


inequalities involving 


binomial coefficients 










b 
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(a' 


^ IT ' 


a-l\ 
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a-b 
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we obtain 



















log |J^| < log 



n 

k/2 



3k/2 -l\\ 

_± ] 

?,k/2 - 1 ^ 
k/2 



log 



n 

k/2 



■ik/2 \ \ 

3k/2 \ 
k/2 J 



< 



k n k 3k k k n 5 ^ Ilk n 

2log^ + -log3 + -loge---l<-log- + -fc<— log- 

D 

We are now ready to prove the lower bound. 

Theorem 3.3 For any n > 2, let T he an (n, k)-selective family with 2 < k < ■^. Then it holds 

that 

, , k n 

Proof. Let k', ^ < k' < k, he a power of 2. Let x{G) be the chromatic number of the graph 
G whose vertices are all the /c'-subsets of [n] and whose edges connect vertices having exactly y 
common elements. The theorem is an immediate consequence of the following inequalities 

logxiG) > Aiog^ (1) 

l-^l>logx(G'). (2) 

We first prove Ineq. |l|. For any graph G{V, E) with stability number a{G) it holds that 

X(0) > J^. (3) 

Clearly a stable set of vertices in G forms an {n,k', Y)-intersection free family satisfying the 



conditions of Corollary 3.L hence, from Ineq. H and Corollary 3.1, we have that 



(n \ Ilk' n 
k' )"l2"^°^fe^- 

, n Ilk' , n k' n k n 

> k log log — = — log — > — log — . 

^ k' 12 ^ k' 12 ^ k' - 2A ^ k 

We now prove Ineq. g. Here we use the straightforward inequality 

X{(}G.)<\{X{G.) (4) 

that holds for any set of graphs having the same set of vertices. 

Let be J^ = {Fi, • • • , F\^\]. We define the graph Gi,l<i< |J^|, by setting V{Gi) = V{G) and by 
drawing an edge between two vertices of Gi if they are adjacent in G and furthermore |Fj H X| = 1, 
where X is the symmetric difi^erence of the sets corresponding to the two vertices. Since ^ is a 
(n, /c)-selective family and the symmetric difference of these sets has cardinality k' , for any edge of 
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G, there will be at least a graph Gi having this edge. Hence we have G = U^^^G,. It thus follows 

that 

l-^l l-^l l-^l 

logx(G) = logx(U Gi) < lognx(G.) =Y,logx{G,) < |^| 

i=l i=l i=l 

where the first inequality follows from Ineq. ^ and the last inequality follows by noting that the 
graphs Gi are bipartite graphs (i.e. x{Gi) < 2): indeed, for any two adjacent vertices in Gi one 
has odd intersection with the elements of Fi and the other has even intersection. □ 



3.2 Strongly-selective families 



In [DR83], Dyachkov and Rykov proved a lower bound il(cytlogn) on the size of (n, /i:)-strongly- 
selective families, where Ck — > Q{k'^ / \ogk) for k — > oo. Observe that this does not imply the 
standard two-variable lower bound r2((A;^/logfc) logn): for instance when k = r2(n), this would 
imply a lower bound O(n^). The latter is clearly false. Indeed, the family consisting of all the 
singletons from [n] is (n, /i;)-strongly-selective for any k = 1, . . . ,n, and it has size n. In | CR96|| , 
Chaudhuri and Radhakrishnan obtain a lower bound (/c^ log n)/ (100 log A;) for sufficiently large k 
such thatg k < n^'^. Our contribution here is the generalization (and an improvement) of the 
Chaudhuri and Radhakrishnan's result. 

We prove a lower bound that is only an 0(logA;) factor away from the 0(min{n, A;^ logn}) 
bound in [|EFF85|]. 



Theorem 3.4 Let T he an {n,k)- strongly- selective family. 
i)* If 'i ^ k < \/2n — 1 then it holds that \!F\ > ^g^^— ^logn. 
ii). If k > v2^ then it holds that \J^\ > n. 



Proof, i). The proof relies on a result by Fiiredi |F9€] and a result by Bassalygo [DR83| on 
superimposed codes. For the sake of convenience, we state such results in terms of strongly-selective 
families. 

Let J- be an (n, fc)-strongly-selective family then Bassalygo proved that 



\J-\ > mm < ,n 



(5) 



and Fiiredi proved that, for A; > 3, 



n < A- 1 + 



/ 


1^1 
( k\ 


\ 


\ 


[') 


J 



(6) 



Notice that the conditions on k are stated immediately before Lemma 5.1 of [CR96] 
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Let 3 < A; < \/2n — 1. From Ineq. |6| and the inequality < (^) > we get 



_i I ]r^,r(r, - h J- ^\ <^ 9\ T\ - -^h J- •? J- h'^ 

^^g el^lfc(fc-l) 



^ log(n -k + l)< 2|J^| -3k + 2 + k^ 



2[\r\-k+l) 

Since -^ < k{k — 1) < k"^ and -yn < n — A; + 1, it follows that 

k^ 



4iog^0^ 



loen< 2|:F| - 3A; + 2 + /c^ 



2(|:F|-fc+i) 
Moreover, since k < \p2Jn — 1, Ineq. S implies that 

and then 



2(|.F|-A; + 1 
We thus obtain 

k^ 



' ' < 1 and - 3/e + 2 + A:^ < 2\T\ 



41og(eA;2) 



logn< 4|J^|. 



Finally, since /c > e, \T\ > ^^T^ ^^^n. 

ii). When k > \'2n, the thesis follows immediately from Ineq. |5[ □ 

3.3 Sets of pairwise r-different sequences 

Our goal in this section is to prove a lower bound on the length of n sequences which are pairwise 

r-different. In the sequel, we will make use of the binary entropy function h{t) = —tlogt — (1 — 

i)log(l-t). 

The proof of our lower bound relies on the following nice theorem proved in FKOO|| 



Theorem 3.5 |FKOO| Let S be a subset of (^2 ) ^.''^d C be a set of sequences of length m over the 
alphabet [r] with the property that for each {x,y} G (2) and {a,b} G S there exists an i ^ [m] such 
that {xi,yi} = {a,b}. Then it holds that 

log \C\ < TTimax min < (pa + Pb)h 

where, in the maximum, P is running over all the probability distributions on [r\ . 

Theorem 3.6 Let M{n, r) denote the minimum length of n sequences which are pairwise r- 
different. Then 

M{n, r) = q( -^ log n ) . 
Vlogr J 
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Proof. Let C be a set of n sequences which are pairwise r-different and define S = {{0, i}\i G [r]}. 



From Theorem 3.5| we have that: 

Pi 



log n < M(n, r) max min < (po + Pi)h 

P ie[r] [ \Po+Pi 

Let fix) = (po + x)h (^) , < X < 1. We have that f'{x) = log 2o±2 > for any < x < 1. 
Thus / is not decreasing and we can restrict the search of the maximum value, in the right hand 
of the above inequality, to those probability distributions in which pi have the same value for all 
i G [r], i.e., pi = (1 -po)/r. 

We thus consider for any x £ [0, 1], the probability distribution pi = f ior i £ [r] and pQ = 1 — x. 
Then the inequality can be written as: 



log n < Min, r) max < \ 1 — x -\ — ] h 
'xe[o,i] V rj 



In order to prove the theorem, we show that 



Indeed, the function 



max fl-x + i^)J-i-, =0^^°^^ 
^e[o,i]V rJ \l-x + f/J 



f{x) = [l-x + -)h( 



rj'\l — x+- 

can be written as 

/ X \ X r-rx + x 

f(x) = 1 - x log 1 + — + - log . 

V r(l — X) J r X 

Then, by using the well known inequality 1 + 1 < e* (that holds for any real t), we get: 

(1 — x) log ( 1 H — r ) < (1 — x) log e'"(i-^) < - log e < - log e = O ( - 

V r(l — x) J r r \r 

It thus suffices to prove that 



,, X- r — rx + X ( log r 

g[x) = - log = O 



r X \ r 

Since '^-'^^+^ ig a decreasing function in the interval set [-^f^, 1] then, for x G [^f^y 1]) 

X 1 / log T 

g{x) < — log2r < -log2rGO 



Furthermore, 

, 1 r — rx + X 1 

g[x) = - log ■ — — — 

r X (r — rx + xj in 2 

is strictly positive in the interval set [0, g/lj^ ]. Thus, in [0, 3^777], the function g{x) is increasing. 
Hence, for x G [0, 37:771], 

This completes the proof. □ 
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4 Broadcast operations 

4.1 The lower bounds 

In this section, we show the existence of an infinite family of directed graphs that force any DB 
protocol to perform, in the worst-case, fl{nlog D) time-slots before completing a broadcast. Then, 
we provide a simple variant of this family of graphs yielding a lower bound that also depends on 
A. Our lower bound holds for the UB model (and, thus, for the BB model too). We first formalize 
the notion of DB protocol according to [ BGI87| . 



Definition 4.1 A Deterministic distributed Broadcast DB protocol P is a protocol that works in 
time-slots (numbered 0, 1, . . .) according to the following rules. 

1. In the initial time-slot a specified node (i.e. the sourcej transmits a message (called the source 
message^. 

2. In each time-slot, each node either acts as transmitter or as receiver or is non active. 

3. A node receives a message in a time-slot if and only if it acts as receiver and exactly one of 
its in-neighbors acts as transmitter in that time-slot. 

4. The action of a node in a specific time-slot is a function of its own label, the number of the 
current time-slot t, and the messages received during the previous time-slots. 

Theorem 4.1 For any DB protocol P, for any n and for any D < n/6, there exists an n-node di- 
rected graph G^ of maximum eccentricity D such that P completes broadcasting on G^ in 0(n log D) 
time-slots. The lower bound holds even when every node knows n. 

Proof. The graph G^ is a layered n-node graph with D + 1 levels Lq, Li, . . . , Ld; Level Lq contains 
only the source s, level Lj has no more than [n/{2D)\ nodes for j = 1, . . . ,D — 1 and, finally, the 
level Ld contains all the remaining nodes. All nodes of Lj-i have outgoing edges to all nodes in 
Lj. As we will see later, the actions specified by P determine the node assignment in the levels 
j > 1 in such a way that the protocol is forced to execute 0,{{n/D)logD) time-slots in order to 
successfully transmit the source message between two consecutive levels. This assignment will be 
performed by induction on the levels. 



Prom Theorem 3^, there exists a constant c > such that, if 2 < D < n/6, any ([n/2] , \_n/{2D)\)- 
selective family must have size at least T, where T = [c-^ log D\ . 
The theorem is then an easy consequence of the following 

Claim 1. For any j = 0, . . . ,D — 1, it is possible to assign nodes in Lq, Li, . . . ,Lj 
in such a way that P does not broadcast the source message to level Lj before the 
time-slot j ■ T. 

Proof. The proof is by induction on j. For j = 0, the claim is trivial. We thus 
assume the thesis be true for any j and we prove it for j + 1- Let us define 

R = {nodes not already assigned to levels Lq, . . . , Lj} 
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Notice that \R\ > \n/2]. In fact 



' (D - 2) - 1 > 



\R\ = n — 2J \Lh\ > n 

h=0 



n 



2D 



Let L be an arbitrary subset of R. Consider the following two cases: i) ^j+i is 
chosen as L, and ii) ij+i is chosen as R (i.e. all the remaining nodes are assigned 
to Ljj^i). In both cases, the predecessor^ subgraph G^ of any node u E L is that 
induced by Lq U Li U . . . -^j+i U {u} in G^ . It follows that the behaviour of node 
u, according to protocol P, is the same in both cases. We can thus consider the 
behavior of P when Lj+i = R- Then, we define 

Ft = {u ^ R\ u acts as transmitter at time-slot j -T + t}. 

and the family JT = {Fi, . . . ,Ft-i} of subsets from R. Since \J^\ < T, !F \s not 
([n/2], [n/(2D)J)-selective; so, a subset L C R exists such that \L\ < [n/{2D)\ and 
L is not selected by J-' (and thus by P) in any time-slot t such that jT + 1 < t < 
{j + 1)T — 1. The proof is completed by choosing Lj+i as L. 



n 



Theorem 4.2 Let P be a DB protocol. Then, for any n, for any D < n/6, and for any A < n/D, 
there exists an n-node directed graph G^ of maximum eccentricity D and in-degree bounded by A 
such that P completes broadcasting on G in 0(DAlog(n/A)) time-slots. The lower bound holds 
even when every node knows n and A. 



Proof. The proof is based on the same construction of the proof of Theorem ^.1[ The only 
difference is that, for every j = 1,2, ... D — 1, level Lj of G consists of at most A nodes and Ld 
(consisting of all the remaining nodes) is connected to the previous level in such a way that the 
maximum in-degree is kept not larger than A. D 

4.2 The upper bounds 

This section provides a DB protocol for unknown networks. For case of exposition, we first describe 
the protocol that assume the knowledge of n and A. Then, we show how to extend the same 
technique to the cases in which A and n are not known by the nodes. 
The following protocol assumes the knowledge of A and n. 

Description of Protocol BROAD-A(n, A). The protocol uses an (n, A)-selective family J^. It 
starts by setting all the nodes to the active state, and by let s transmit the source message. After 
the first time-slot, it turns s to the non active state. Then, it performs a sequence of consecutive 
identical phases. Let us fix an arbitrary ordering for the sets of J^; at time-slot j of phase i, each 
node V acts according to the following rule: v transmits the source message along its outgoing edges 
if and only if 

■^Given a graph G, the predecessor subgraph G„ of a node u is the subgraph of G induced by aU nodes v for which 
there exists a directed path from v to u. 
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1) the label of v belongs to the j-th set of J^, and 

2) V has received the source message for the first time during the phase i — 1. 

After the phase in which a node v acts as a transmitter, it turns to the non active state (so, this 
is a first change w.r.t. the straightforward protocol described in Section g). The active nodes that, 
at at time-slot j of any phase, have a state not satisfying Conditions (1) and (2) act as receivers. 
Observe that Condition (2) is the key difference between our technique and the straightforward 
one. As we will see in the analysis of the protocol, this difference will play a crucial role in order 
to achieve an upper bound not containing the linear factor n. 

Theorem 4.3 Protocol BROAB-A{n, A) completes broadcasting and teruiinates inO{DA\og{n/A)) 
time-slots on any n-node graph of maximum eccentricity D and maximum in-degree A. 



Proof. Since, Theorem 3.1 implies that \J-\ = 0(L'Alog(n/A)), the thesis is an easy consequence 
of the following claim. 

Claim. A node v receives (for the first time) the source message at phase i of protocol 
BROAD-A(n, A) if and only if v is at distance i + 1 from the source s. 

Proof. The proof is by induction on i. {-^). For i = the Claim is obvious. We 
thus assume that all nodes at distance i have received the source message during 
phase i — 1. Let us consider a node v at distance i + 1 during phase i. This node has 
at least one informed in-neighbor at distance i. According to the protocol, only the 
neighbors of v informed in phase i— 1 will act as transmitters in phase i. Then, from 
the (n. A) -selectivity of T, there will be a step of phase i, in which only one of these 
informed in-neighbors will transmit to v. (^). If v is not at distance i -\-\ from s, 
then two cases may arise. If v is at distance less than i + 1 then, by the inductive 
hypothesis, v has been informed before phase i. Otherwise, v is at distance greater 
than i + 1, so none of its in-neighbors has been informed before phase i. 



D 



The next protocol assumes the knowledge of n. 



Description of Protocol BROAD-B(n). Each node runs a sequence of phases, each of them 
consisting of [logn] time-slots. In time-slot I {1 < I < [logn]) of phase /i, each node runs time- 
slot h of BROAD-A(n, 2 ). Furthermore, if a node v is set to the non active state in a time-slot of 
BROAD-A(n, 2') for some 1 < / < [logn] , then it will stay inactive for all the rest of BROAD-B(n). 

Theorem 4.4 Protocol BROAD-B(n) completes 6roadcasiing and terminates in 0(L'Alognlog(n/A)) 
time-slots on any n-node graph of maximum eccentricity D and maximum in-degree A. 

Proof. Since G has maximum in-degree d, the execution of BROAD-A(n, 2''*) where l^ is the 
minimum integer such that d < 2'' satisfies Claim in the proof of Theorem 4.S. Observe that 



during this execution, a node v, that satisfies the two conditions for transmitting, could be already 
in the non active state because of the execution of some BROAD-A(n, 2^) with / < l^. However, if 
this is the case, v has already successfully transmitted the source message to all its out-neighbors. 
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n 

In the third protocol, the nodes only know their respective labels. 

Description of Protocol broad'"' (a > 1). Informally speaking, this protocol consists in 
running BROAD-B(n) with n = 2 , for i = 1,2,... One of these executions will be the "good" 
one. However, applying a direct "dovetail" scheduling would result into a completion time of 
O ((-DAlognlog(n/A))^) (recall that nodes do not know n). So, in order to bound the extra-time 
by a factor of 0(log° n), Protocol broad^"' executes different applications of broad-b(-) according 
to a more sophisticated dovetail technique. Consider the following family of functions: 



f^iz) = 0, f^{z) 



2. 



{k- z), k = 1,2,3,... 



Protocol broad^") consists of a sequence of phases, denoted as phase(A;), k = 1,2,3,... The 
phase(A;) is in turn formed by k stages: in STAGE(k,i) (with I = 0,1, . . . ,k — l), the nodes execute 
the time-slots 

A«_i(£) + l,A"_i(£) + 2,..., /,"(£) 

of broad-b(2^). If a node v is not active in a time-slot of broad-b(2^), for some £, then it will 
remain non active for all the rest of broad^"). This new dovetail technique is shown in Figure |l|. 
Observe that a node v during the execution of a time-slot of broad-b(2 ) could have been informed 
for the first time during a time-slot of the execution of broad-b(2^') for some t ^ £'. In this case, 
by definition of Protocol broad-a(-, •), the node v acts as an informed node. 

Theorem 4.5 For any positive constant a > 0, BROAd'"' completes broadcasting and terminates in 
0{DA log(n/A) log"*^"*"" n) time-slots on any n-node graph of maximum eccentricity D and maximum 
in- degree A. 

Proof. The execution of broad-b(2 "), for £„ = [log n] , will be the good one and it has completion 
time 0(Z)Alog(n/A) logn). By definition of BROAD^'^^, it follows that all the nodes turn to the 
non active state within the last time-slot t^nd = 0(i^Alog(n/A) logn) of broad-b(2^"). We thus 
need to upper bound the time in which this happens, i.e., when PHASE(A;e„d) is completed, where 
kend is the smallest integer such that k^nd > ^n and /^ (£„) > tend- From the definition of /^, it 
follows that 

kend < log 2 tend (7) 

Let T be the number of time-slots required to complete PHASE(A;end)- From the definition of phase 
and stage of the protocol, it holds that 

^end rC i "'end -'- 

fc=i 1=0 e=o 

It thus follows that 



k J— 1 

£=0 



end 



/ , y'^end 
£=0 



< 2 



end 



A-2 

'^end 



(8) 
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Figure 1: The figure refers to tlie case a = 2: the abscissa represents the executions of broad-b(2 ), 
while the ordinate represents the time-slots of broad-b(2 ). 
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Finally, by combining Ineq. ^ with Ineq. Q, we get T < tf.nd log" t^nd and so 

T = ©(DA log(n/A) log^+" n). 

U 

5 Multi-Broadcast: The BB model 



We first need to extend Definition ^J| to multi-broadcast operations. We assume here that each 
message has an header containing a unique ID number so that two messages have different ID 
numbers. 

Definition 5.1 ^ multi-DB (^multi-RBj protocol P is a protocol that, given a graph G and a set 
of r broadcast operations on it (in short, an r-hroadcast operation, r > 1), works in time-slots 
(numbered 0, 1, . . .) according to the following rules. 

1. In every time- slot, each node either acts as transmitter or as receiver. When transmitting, 
the node sends one message. 

2. All the nodes share the same message-recovering function TZ that takes any message m as 
input and, if any, returns the (unique) broadcast message contained in m. 

3. A node receives a message in a time-slot if and only if it acts as receiver and exactly one of 
its in-neighbors acts as transmitter in that time-slot. 

4. The actions of a node in a specific time-slot are function of its own label, the number of the 
current time-slot t, and the messages received during the previous time-slots (for multi-RB 
protocols, the actions also depend on the output of a random bit generators). 

5.1 The lower bound for deterministic protocols 

The next theorem provides a lower bound which is a function of the congestion c and n. 

Theorem 5.1 Let P be any multi-DB protocol. Then, for any n > A and c > 2, there exist an 
n-node directed graph G^ , with D = 3, A = 2, and an r-broadcast operation (with r > c) on G^ 
(yielding a congestion c), such that P completes this operation in 0((c/logc) logn) time. 

Proof. Without loss of generality we consider the case in which c is arbitrary fixed and r = c (i.e., 
maximum congestion). The graph G^ will be one of the family Qn described below. Any graph 
in Qn is an n- nodes directed graph of 3 levels. The first level consists of the (unique) source node 
(with label 1) containing all the r messages. The source is then connected to n — 2 nodes that form 
the second level. Finally, the third level has only one node (with label n), i.e., the sink. The sink 
has exactly two in-neighbors among the nodes in the second level. We denote by Gu,v the graph in 
which the sink has u and v as its in-neighbors. So, 

Gn = {Gu,v \ u,v e {2,3,... ,n-l}} 
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Since the sink cannot send any information to any other node, the execution of P, with respect to 
any non sink node, is the same for every graph of the family. Let T be defined as fohows 

T = max {t I P has completion time t on Gu,v} 

We represent the execution of the first T time-slots of P,with respect to to a node v in the second 
level, as a sequence x^ over the alphabet {0, 1, . . . , r} with the following meaning: x^(t) = z {z > 1) 
if, at time-slot t, v sends a message m such that TZ{m) is the z-th source message, where TZ is the 
message-recovering function. Furthermore, Xv{t) = if, at time-slot t, either v acts as receiver or 
it sends a message m such that Tl{m) is not defined. We thus obtain a set V^ of n — 2 sequences 
of length T over the alphabet {0, 1, . . . , r}. 

We claim that a necessary condition to complete the r-broadcast on every graph in Q is that any 
two sequences Xu and x^, with v ^ u, must be r-dijferent , i.e., for any element z G [r] there is a 
coordinate t for which the set {xu{t),Xv(t)} is equal to {2;,0}. 

Indeed, assume by contradiction that this is not true. So, there are two sequences Xu, x^ with 
V ^ u and z £ [r], such that {xu(t),Xy{t)} is not equal to {z,0} for every t <T. We then consider 
protocol P on the graph Gu,v It is easy to verify that, in Gu,v, the sink is reachable from the 
source, but the sink does not receive the z-th message during the first T time-slots. 
From the above discussion, the n — 2 sequences of length T in V^ must be pairwise r-different. 
From Theorem |3.6| , in order to have a set of n — 2 pairwise r-difference sequences of length T, it 
must hold that 



T = n[ logn 

Vlogr 

Since T is a lower bound on the worst-case completion time of P over the graph family Qn, the 
theorem follows. □ 

Theorem 5.2 Let P be any multi-DB protocol. Then, for any n > 4, 3 < D < n/6 and 2 < c < r, 

it is possible to define an n-node directed graph G^ with maximum eccentricity D and maximum 
in-degree A = 2, and a set of r independent broadcast operations on G (yielding a congestion c) 
such that P completes these operations on G^ in Q{c+ (c/logc + L')logn) time. 

Proof. Since a node can receive at most one message per time-slot, it is easy to obtain the lower 
bound Q{c). Then, by combining the family of graphs of Theorem |4.2| (with A = 2) with that 



yielding Theorem 5.1, we easily get the thesis. □ 



Notice that the proof of Theorem 5.1 does not rely on the fact that the nodes do not know n. Thus, 
Theorem ^^ also holds under this condition. 

Finally, we observe that the above contruction can be easily modified in order to let each of the r 
broadcast messages start from a different source (i.e. we have r messages in r different sources). It 



suffices to replace the source node in Qn (see the proof of Theorem 5A) with the root of a binary 
directed tree in which the r sources are the leaves of the tree. The only difference is that D is now 
a logarithmic function of n. 
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5.2 Lower bound for randomized protocols 

Any worst-case time bound of a randomized protocol can be considered reliable if it happens 
within a high probability on every possible instance. This concept is widely adopted in the field 
of randomized algorithms [ MR95| , and it can be easily adapted to the case of multi-broadcast 



operations on unknown networks. 

Definition 5.2 A multi-RB protocol P has reliable completion time T (where, clearly, T depends 
on n) if, for any n > 1 and for any n-node graph G, P completes, with probability at least 1 — 1/n 
any r-broadcast operation on G within T time-slots. 



Lemma 5.1 Let P be a multi-RB protocol for unknown networks. If P has reliable completion 
time T, then it must holds that T = r2((c/logc) logn). 

Proof. The proof makes use of the families Qn {n > 4) of directed graphs, and the corresponding 
multi-broadcast operations, which have been introduced in the proof of Theorem p.l| . In particular, 
we will show that, for any n > 4, a graph G^ € Gn exists on which P has ri((c/logc) logn) 



completion time with probability larger than 1/n. As in the proof of Theorem 5.1, an execution of 
T time-slots of P, with respect to the nodes of the second level of Gu,v, can be represented as a set 
D of n — 2 sequences of length T over the alphabet {0, . . . ,r} (with the same meaning of that given 



in the proof of Theorem |5.l| ). A multi-RB protocol (restricted to the nodes in the second level of 
Gu,v) can thus be seen as a probability distribution V over the set A of all possible sequence sets 
T). Consider the following function 

■jy, \ _ J 1 if sequences u and v inT) are r-different 
X i^' "^i - I otherwise 

From the hypothesis of the theorem we have that, for all u ^ v, 

Pr{P has completion time T on Guv} ^1 , 

n 

it follows that 

\^u^v ^P(P)x''(n,z;)>l-- (9) 



veA 



Consider now the sum 



U,VGL2 

where L2 denotes the nodes of the second level of any graph in G^, this equals the number of 
r-different pairs yielded by the protocol P. We now prove that there has to exist a D such that 

,.77 / 1\ (n-2)(n-3) , , 

M- > (1 - -) i \ 1 (10) 

In fact, from Ineq. pi, we have that 



21 



and, hence 

It follows that a P exists that verifies Ineq.^. We now prove that 2? must contain a large subset of 



sequences which are pairwise r-different, thus the same property derived in the proof of Theorem 5.1 



Claim. There exists a subset 5 C I? such that \S\ = n/2 and, for any u,v £ S (with 
u 7^ v), the pair {u,v) is r-different. 

Proof. The following simple algorithm finds the desired subset 5 (we assume here 
that n is an even number). 

begin 

Choose an arbitrary S CD s.t. \S\ — n/2 

S" := 0: 

while {S does not satisfy the claim) do 

begin 

Choose (arbitrarily) w, w e 5 that are not r-different 

Choose_ (arbitrarily) k€V\(SUS'') 

5:=(5-M)U{fc}; 

S" := S" U {v} 

end 

return iS. 
end 



We first notice that, from Ineq.lO, there are at most (n — 3)/2 different pairs in D 
that are not r-different. Since at every iteration of the while loop the algorithm 
"discards" a new not r-different pair in D, the algorithm always returns a set S 
satisfying the claim in 0{n) steps. 

The claim implies that the reliable multi-RB protocol P on the family Qn must yield a set of n/2 
sequences that are pairwise r-different. So, by applying the lower bound of Theorem |3.6| , we can 
state that 

T = n f^log^) = n (-^logn 
\ log r 2 J \ log r 

Since T is a lower bound on the completion time of P, the lemma follows. □ 

The proof of the following theorem is an easy consequence of the i}{D\og{n/D)) lower bound 
for randomized protocols given in |KM93|, the trivial lower bound f^(c), and Lemma ^.1| . 
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Theorem 5.3 Let P he any multi-RB protocol. Then, for any n > 4, D > 3 and 2 < c < r, 

there exist (i) an n-node directed graph G^ with maximum eccentricity D and maximum in-degree 
A = 2, and (ii) a set of r independent broadcast operations on G (yielding a congestion c), such 
that the reliable completion time of P is Vt{c+ {c/\ogc)\ogn + D\og{n/D)). 

The above theorem in fact holds for any probabihty lower bound of the form 1 — l/n"" (with 
any fixed constant a > 0). However, we choose the form 1 — 1/n in order to simplify the proof of 



Lemma 5.1. 



5.3 The multi-broadcast protocols 



As mentioned in the Introduction, our multi-DB protocols make use of superimposed codes [EFF85 
HS87 , 197 , |CHI99 |. In particular, we will use the following upper bounds | EFF85 , KS64 |. 



Theorem 5.4 For any n>2> and for k > 2, 

EFF89^ there exist (n, k)-strongly-selective families of size 0(min{n, fc^ logn}); 



KS64] (p- 370) it is possible to construct, in time polynomial in n and k, an (n, k)-strongly- 
selective family (based on q-ary error-correcting codesj of size 0(min{n, k'^ log n}). 



We recall that an Q((A:^/log fc) logn) lower bound is proved in Theorem |3.4| . 

In what follows, we describe the protocol MULTl-BB-BROAD-A(n, A): It assumes that nodes know 
A and n. However, when A and/or n are not known, we can adopt the same dovetail technique 
described in subsection 4.2. The cost of this further task is 0(log ^" n) additional time-slots (for 



any fixed a > 0) per each time-slot of MULTi-BB-BROAD-A(n, A). 

Description of Protocol MULTi-BB-BROAD-A(n, A). With each message is associated a priority 
so that the priorities induce a total ordering on the set of broadcast messages^. The priorities will 
be used by the nodes to schedule the messages to send. In fact, every node stores the messages 
by means of a priority queue. The protocol MULTi-BB-BROAD-A(n, A) uses an (n, A + l)-strongly- 
selective family J^ = {Fi,F2, . . . F\y^\}- It consists of a sequence of consecutive phases. Each phase 
consists of \J-'\ time-slots. At the very beginning, the priority queues of the source nodes contain 
their broadcast messages, the other priority queues are empty. At the beginning of each phase, 
every node v with a non empty priority extracts from the queue the message m„ of highest priority. 
At the j'-th time-slot of the phase, node v acts according to the following rules. 

- If the label of v belongs to Fj and ruy exists then v transmits m^. 

- In all the other cases, v acts as a receiver. If v receives a message that v has never 

received before, the message is enqueued, otherwise, the message is discarded. 

Theorem 5.5 Protocol MULTl-BB-BROAD-A(n, A) completes any r-broadcast operation within 0{(D+ 
c) min{n, A^ log n}) time-slots on any n-node graph of maximum eccentricity D, maximum in-degree 
A, and congestion c. 

*A possible choice for the priorities is the following. If m is the h-th message of a source of label I then the priority 
of m is given by the pair {l,h). Thus, for any two messages m and m' of priorities (l, h) and {I' ,h'), m has priority 
higher than m' if either I < I' or I — I' and h < h' . 
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Proof. Firstly, we prove the following 

Claim 1 If a node v transmits a message during a phase t then all the out-neighbours 
of V receive the message by the end of phase t. 

Proof. In fact, let u be any out-neighbour of v. By virtue of the (n, A -|- l)-strongly 
selectivity of J-', there is a time-slot of phase t in which u acts as a receiver and v is 
the only node, among the in-neighbors of u (notice that these are at most A), that 
transmits. Hence, in that time-slot, u receives the message of v. 

Now, we can show that all messages reach their destinations. 

Claim 2 After a finite number of phases, any message m is received by all the nodes 
that are reachable from the source of m. 

Proof. The proof is by induction on the distance i from the source. For i = the 
claim is obvious. Let u be a node at distance i + 1 from the source. Consider an 
in-neighbour f of u at distance i. From the inductive hypothesis, node v receives 
the message m. The first time this happens, m is enqueued in the priority queue of 
V. According to the protocol, at the beginning of each phase, node v extracts the 
message of highest priority and transmits it. Since there are less than r messages of 
priority higher than that of m and any message is enqueued at most once, it follows 
that V extracts and transmits m within at most r phases. Thus, from Claim 1 node 
u receives the message m. 

The last step consists of showing that the messages cannot be delayed too much. 

Claim 3 Assume that node u, at distance i from the source of a message m, receives 
m for the first time during phase i + t„. Then, during the first i + tu phases, u 
transmits at least t^ — 1 messages with higher priority than that of m. 

Proof. The proof is by induction on the distance i. For i = the claim is obvious. 
Let u he a node at distance i + 1 and let v be the node at distance i from which u 
receives m for the first time. Let i + ty be the phase in which node v receives m for 
the first time. By inductive hypothesis, in the first i + ty phases, v transmits at least 
ty — 1 messages of higher priority (than that of m). This fact and Claim 1 imply 
that, in the first i+ty phases, u receives from v a set Mi of at least ty — 1 messages of 
higher priority. Now, let /c > be the number of phases during which the message 
777. remains in the priority queue of v. This implies that v transmits a set M2 of 
k further messages of higher priority before transmitting m. As a consequence, u 
receives all the messages in M2 before receiving m. 

Let tu = ty + k; then node u receives m for the first time in phase l + l + tu- When 
this happens u has received from v at least t^ — 1 + A; = t„ — 1 messages of higher 
priority (i.e., all the messages in Mi U M2). Since all these messages have been 
received in distinct phases among the first i + t^-, the node u transmits at least tu — 1 
messages of higher priority in the first i +\ + tu phases. 
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Notice that any message m has to reach nodes at distance at most D from its source and there are 
less than c messages of priority higher than m that colhde with m. Hence, Claim 2 and Claim 3 
imply that any message reaches its destinations in less than D + c phases. Since each phase requires 
0(min{n, A^ logn}) time-slots (see Theorem. ^.4| ) the thesis follows. □ 



6 Conclusions and open problems 

The main contribution of this paper is that of providing explicit and strong connections between a 
set of old and new combinatorial tools and the issue of broadcast operations in radio networks of 
unknown topology. Thanks to such connections, we have obtained new lower and upper bounds on 
the completion time of this important operation. We believe that the concept of selectivity and that 
of r-different sequences can have further applications to other distributed models in which the local 
knowledge is extremely low. Some evidence of our opinion is also given by some previous results 
Lin92(| in which strongly-selective families have been used for the distributed coloring problem in 



unknown graphs. 

As for specific research directions, the following ones appear to be the most relevant. 
The upper bounds for the DB protocols are not efficiently constructible since the corresponding 
"small" selective families have been derived by using probabilistic arguments: it would thus be 
important to design an efficient deterministic algorithm that constructs such families. 
The lower bounds for multi-DB (and multi-RB) protocols are consequences of the combinatorial 
lower bound ^(1377 log n) on the length of n pairwise r-different sequences given in Theorem ^^. We 
do not know whether the latter is tight. As far as we know, the best upper bound is 0{r log n). The 



deterministic upper bounds in Theorem 5^ almost match the lower bound in Theorem ^^, when 
A = O(polylogn). An interesting future research goal is that of reducing the gap between upper 
and lower bounds for larger A. To this aim, we believe that a generalization of the r2(jj^logn) 
lower bound to d-wise r-different sequences could give a stronger lower bound that also depends 
on A. 

A further research direction is that of using the same combinatorial tools to investigate the 
issue of dynamical-fault tolerance in radio networks. Dynamical edge and node faults may happen 
at any instant, even during the execution of a protocol. Some results on this direction have been 
recently obtained in |CMS01c|. 
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